A writing system comprises a set of , called a script, as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independently invented writing system gradually emerged from a system of proto-writing, where a small number of were used in a manner incapable of fully encoding language, and thus lacking the ability to express a broad range of ideas.
Writing systems are generally classified according to how their symbols, called , relate to units of language. Phonetic writing systemswhich include and graphemes that correspond to sounds in the corresponding spoken language. Alphabets use graphemes called letters that generally correspond to spoken . They are typically divided into three sub-types: Pure alphabets use letters to represent both consonant and vowel sounds, generally only use letters representing consonant sounds, and use letters representing consonant–vowel pairs. Syllabaries use graphemes called that represent entire or moras. By contrast, logographic (or morphographic) writing systems use graphemes that represent the units of meaning in a language, such as its or . Alphabets typically use fewer than 100 distinct symbols, while syllabaries and logographies may use hundreds or thousands respectively.
The relationship between writing and language more broadly has been the subject of philosophical analysis as early as Aristotle (384–322 BC). While the use of language is universal across human societies, writing is not; writing emerged much more recently, and was independently invented in only a handful of locations throughout history. While most spoken languages have not been written, all written languages have been predicated on an existing spoken language. When those with signed languages as their first language read writing associated with a spoken language, this functions as literacy in a second, acquired language.; citing A single language (e.g. Hindustani) can be written using multiple writing systems, and a writing system can also represent multiple languages. For example, Chinese characters have been used to write multiple languages throughout the the Vietnamese language from at least the 13th century, until their replacement with the Latin-based Vietnamese alphabet in the 20th century.
In the first several decades of modern linguistics as a scientific discipline, linguists often characterized writing as merely the technology used to record speechwhich was treated as being of paramount importance, for what was seen as the unique potential for its study to further the understanding of human cognition.
A grapheme is the basic functional unit of a writing system. Graphemes are generally defined as minimally significant elements that, when taken together, comprise the set of symbols from which texts may be constructed. All writing systems require a set of defined graphemes, collectively called a script. The concept of the grapheme is similar to that of the phoneme in the study of spoken languages. Likewise, as many sonically distinct phones may function as the same phoneme depending on the speaker, dialect, and context, many visually distinct (or graphs) may be identified as the same grapheme. These variant glyphs are known as the of a grapheme: For example, the lowercase letter may be represented by the double-storey and single-storey shapes, or others written in cursive, block, or printed styles. The choice of a particular allograph may be influenced by the medium used, the writing instrument used, the stylistic choice of the writer, the preceding and succeeding graphemes in the text, the time available for writing, the intended audience, and the largely unconscious features of an individual's handwriting.
Orthography () refers to the rules and conventions for writing shared by a community, including the ordering of and relationship between graphemes. Particularly for , orthography includes the concept of spelling. For example, English orthography includes uppercase and lowercase forms for 26 letters of the Latin alphabet (with these graphemes corresponding to various phonemes), punctuation marks (mostly non-phonemic), and other symbols, such as numerals. Writing systems may be regarded as complete if they are able to represent all that may be expressed in the spoken language, while a partial writing system cannot represent the spoken language in its entirety.
Writing has been invented independently multiple times in human historyfirst emerging as cuneiform, a system initially used to write the Sumerian language in southern Mesopotamia; it was later adapted to write Akkadian as its speakers spread throughout the region, with Akkadian writing appearing in significant quantities . Cuneiform was closely followed by Egyptian hieroglyphs. It is generally agreed that the two systems were invented independently from one another; both evolved from proto-writing systems between 3400 and 3100 BC, with the earliest coherent texts dated . Chinese characters emerged independently in the Yellow River valley . There is no evidence of contact between China and the literate peoples of the Near East, and the Mesopotamian and Chinese approaches for representing sound and meaning are distinct. The Mesoamerican writing systems, including Olmec and the Maya script, are likewise associated with an independent invention.
With each independent invention of writing, the ideographs used in proto-writing were decoupled from the direct representation of ideas, and gradually came to represent words instead. This occurred via application of the rebus principle, where a symbol was appropriated to represent an additional word that happened to be similar in pronunciation to the word for the idea originally represented by the symbol. This allowed words without concrete visualizations to be represented by symbols for the first time; the gradual shift from ideographic symbols to those wholly representing language took place over centuries, and required the conscious analysis of a given language by those attempting to write it.
The Indus script (), found on different types of artefacts produced by the Indus Valley Civilization on the Indian subcontinent, remains undeciphered, and whether it functioned as true writing is not agreed upon. While its origins are not visually obvious, the opportunity for Mesopotamian cultural diffusion to have introduced the concept of writing to the Indus peoples is clear.
Alphabetic writing descends from previous morphographic writing, and first appeared to write a Semitic language spoken in the Sinai Peninsula. Most of the world's alphabets either descend directly from this Proto-Sinaitic script, or were directly inspired by its design. Descendants include the Phoenician alphabet (), and its child in the Greek alphabet (). The Latin alphabet, which descended from the Greek alphabet, is by far the most common script used by writing systems.
Many classifications define three primary categories, where phonographic systems are subdivided into syllabic and alphabetic (or segmental) systems. Syllabaries use symbols called syllabograms to represent or moras. Alphabets use symbols called letters that correspond to spoken phonemes (or more technically, to ). Alphabets are generally classified into three subtypes, with having letters for , pure alphabets having letters for both consonants and , and having characters that correspond to consonant–vowel pairs. David Diringer proposed a five-fold classification of writing systems, comprising pictographic scripts, ideographic scripts, analytic transitional scripts, phonetic scripts, and alphabetic scripts.
In practice, writing systems are classified according to the primary type of symbols used, and typically include exceptional cases where symbols function differently. For example, logographs found within phonetic systems like English include the ampersand and the numerals , , etc.which correspond to specific words ( and, zero, one, etc.) and not to the underlying sounds. Most writing systems can be described as mixed systems that feature elements of both phonography and morphography.
Logograms are sometimes conflated with , symbols which graphically represent abstract ideas; most linguists now reject this characterization. Chinese characters are often semantic–phonetic compounds, which include a component related to the character's meaning, and a component that gives a hint for its pronunciation.
An abugida is a type of alphabet with symbols corresponding to consonant–vowel pairs, where basic symbols for each consonant are associated with an inherent vowel by default, and other possible vowels for each consonant are indicated via predictable modifications made to the basic symbols. In an abugida, there may be a sign for k with no vowel, but also one for ka (if a is the inherent vowel), and ke is written by modifying the ka sign in a way consistent with how la would be modified to get le. In many abugidas, modification consists of the addition of a vowel sign; other possibilities include rotation of the basic sign, or addition of .
While true syllabaries have one symbol per syllable and no systematic visual similarity, the graphic similarity in most abugidas stems from their origins as abjadswith added symbols comprising markings for different vowels added onto a pre-existing base symbol. The largest single group of abugidas is the Brahmic family of scripts, however, which includes nearly all the scripts used in India and Southeast Asia. The name abugida was derived by linguist Peter T. Daniels () from the first four characters of an order of the Geʽez script, which is used for certain Nilo-Saharan and Afro-Asiatic languages of Ethiopia and Eritrea.
As hangul was consciously created by literate experts, Daniels characterizes it as a "sophisticated grammatogeny"a writing system intentionally designed for a specific purpose, as opposed to having evolved gradually over time. Other featural grammatogenies include developed by professionals and constructed scripts created by hobbyists and creatives, like the Tengwar script designed by J. R. R. Tolkien to write the Elven languages he also constructed. Many of these feature advanced graphic designs corresponding to phonological properties. The basic unit of writing in these systems can map to anything from phonemes to words. It has been shown that even the Latin script has sub-character features.
In Braille, raised bumps on the writing substrate are used to encode non-linear symbols. The original systemwhich Louis Braille (1809–1852) invented in order to allow people with visual impairments to read and writeused characters that corresponded to the letters of the Latin alphabet. Moreover, that Braille is equivalent to visual writing systems in function demonstrates that the phenomenon of writing is fundamentally spatial in nature, not merely visual.
In left-to-right scripts (LTR), horizontal rows are sequenced from top to bottom on a page, with each row read from left to right. Right-to-left scripts (RTL), which use the opposite directionality, include the Arabic alphabet.
Egyptian hieroglyphs were written either left-to-right or right-to-left, with the animal and human glyphs turned to face the beginning of the line. The early alphabet did not have a fixed direction, and was written both vertically and horizontally; it was most commonly written : starting in one horizontal direction, then turning at the end of the line and reversing direction.
The right-to-left direction of the Phoenician alphabet initially stabilized after . Left-to-right writing has an advantage that, since most people are right-handed, the hand does not interfere with what is being written (which, when inked, may not have dried yet) as the hand is to the right side of the pen. The Greek alphabet and its successors settled on a left-to-right pattern, from the top to the bottom of the page. Other scripts, such as Arabic alphabet and Hebrew alphabet, came to be written right to left. Scripts that historically incorporate Chinese characters have traditionally been written vertically in columns arranged from right to left, while a horizontal direction from left to right was only widely adopted in the 20th century due to Western influence.
Several scripts used in the Philippines and Indonesia, such as Hanunoo script, are traditionally written with lines moving away from the writer, from bottom to top, but are read left to right; ogham is written from bottom to top, commonly on the corner of a stone. The ancient Libyco-Berber alphabet was also written from bottom to top.
+ Polygraphy in English ! scope="col" | Phoneme ! scope="col" | Grapheme ! scope="col" | Example |
hail | |||
bay | |||
steak | |||
veil |
+ Polyphony in English ! scope="col" | Phoneme ! scope="col" | Grapheme ! scope="col" | Example |
! scope="row" rowspan="5" style="text-align:center" | saute | ||
red | |||
area | |||
taken | |||
smile |
Writing systems, especially alphabets, often include characters that can represent multiple sound values, or conversely sound values that can be represented by multiple charactersthis phenomenon is referred to as polyvalence. Orthographies with lower or higher polyvalence are referred to as shallow or deep respectively. While polyvalent graphemes are often perceived as defects, they can serve to distinguish homophonic words, and to indicate etymological or semantic connections between words not clear from pronunciation alonee.g. between English sign and signal or child and children. Specifically, an orthographic relationship where one grapheme may represent multiple sound values can be termed polyphony, while a relationship where one sound value may be represented by multiple graphemes can be termed polygraphy.
Scholars have increasingly analysed different patterns of phonological spellings versus morphological spellings in a writing system as being better suited depending on the characteristics of the spoken language, with neither principle being ideal in all circumstances. While not adhering strictly to phonological rules, morphological spellings often follow other patterns that allow for transparent identification and parsing by readers and writers.
|
|